NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Using Large Language Models to Promote Health Equity

https://doi.org/10.1056/AIp2400889

Pierson, Emma; Shanmugam, Divya; Movva, Rajiv; Kleinberg, Jon; Agrawal, Monica; Dredze, Mark; Ferryman, Kadija; Gichoya, Judy Wawira; Jurafsky, Dan; Koh, Pang Wei; et al (January 2025, NEJM AI)

Free, publicly-accessible full text available January 23, 2026
The TRIPOD-LLM reporting guideline for studies using large language models

https://doi.org/10.1038/s41591-024-03425-5

Gallifant, Jack; Afshar, Majid; Ameen, Saleem; Aphinyanaphongs, Yindalon; Chen, Shan; Cacciamani, Giovanni; Demner-Fushman, Dina; Dligach, Dmitriy; Daneshjou, Roxana; Fernandes, Chrystinne; et al (January 2025, Nature Medicine)

Full Text Available
MedShift: Automated Identification of Shift Data for Medical Image Dataset Curation

https://doi.org/10.1109/JBHI.2023.3275104

Guo, Xiaoyuan; Gichoya, Judy Wawira; Trivedi, Hari; Purkayastha, Saptarshi; Banerjee, Imon (August 2023, IEEE Journal of Biomedical and Health Informatics)

Automated curation of noisy external data in the medical domain has long been in high demand, as AI technologies need to be validated using various sources with clean, annotated data. Identifying the variance between internal and external sources is a fundamental step in curating a high-quality dataset, as the data distributions from different sources can vary significantly and subsequently affect the performance of AI models. The primary challenges for detecting data shifts are - (1) accessing private data across healthcare institutions for manual detection and (2) the lack of automated approaches to learn efficient shift-data representation without training samples. To overcome these problems, we propose an automated pipeline called MedShift to detect top-level shift samples and evaluate the significance of shift data without sharing data between internal and external organizations. MedShift employs unsupervised anomaly detectors to learn the internal distribution and identify samples showing significant shiftness for external datasets, and then compares their performance. To quantify the effects of detected shift data, we train a multi-class classifier that learns internal domain knowledge and evaluates the classification performance for each class in external domains after dropping the shift data. We also propose a data quality metric to quantify the dissimilarity between internal and external datasets. We verify the efficacy of MedShift using musculoskeletal radiographs (MURA) and chest X-ray datasets from multiple external sources. Our experiments show that our proposed shift data detection pipeline can be beneficial for medical centers to curate high-quality datasets more efficiently.
more » « less
Full Text Available
AI pitfalls and what not to do: mitigating bias in AI

https://doi.org/10.1259/bjr.20230023

Gichoya, Judy Wawira; Thomas, Kaesha; Celi, Leo Anthony; Safdar, Nabile; Banerjee, Imon; Banja, John D; Seyyed-Kalantari, Laleh; Trivedi, Hari; Purkayastha, Saptarshi (October 2023, The British Journal of Radiology)

Various forms of artificial intelligence (AI) applications are being deployed and used in many healthcare systems. As the use of these applications increases, we are learning the failures of these models and how they can perpetuate bias. With these new lessons, we need to prioritize bias evaluation and mitigation for radiology applications; all the while not ignoring the impact of changes in the larger enterprise AI deployment which may have downstream impact on performance of AI models. In this paper, we provide an updated review of known pitfalls causing AI bias and discuss strategies for mitigating these biases within the context of AI deployment in the larger healthcare enterprise. We describe these pitfalls by framing them in the larger AI lifecycle from problem definition, data set selection and curation, model training and deployment emphasizing that bias exists across a spectrum and is a sequela of a combination of both human and machine factors.
more » « less
Full Text Available
Assessing the potential of GPT-4 to perpetuate racial and gender biases in health care: a model evaluation study

https://doi.org/10.1016/S2589-7500(23)00225-X

Zack, Travis; Lehman, Eric; Suzgun, Mirac; Rodriguez, Jorge A; Celi, Leo Anthony; Gichoya, Judy; Jurafsky, Dan; Szolovits, Peter; Bates, David W; Abdulnour, Raja-Elie E; et al (January 2024, The Lancet Digital Health)

Full Text Available
“Shortcuts” Causing Bias in Radiology Artificial Intelligence: Causes, Evaluation, and Mitigation

https://doi.org/10.1016/j.jacr.2023.06.025

Banerjee, Imon; Bhattacharjee, Kamanasish; Burns, John L.; Trivedi, Hari; Purkayastha, Saptarshi; Seyyed-Kalantari, Laleh; Patel, Bhavik N.; Shiradkar, Rakesh; Gichoya, Judy (September 2023, Journal of the American College of Radiology)

Full Text Available
Toward fairness in artificial intelligence for medical image analysis: identification and mitigation of potential biases in the roadmap from data collection to model deployment

https://doi.org/10.1117/1.JMI.10.6.061104

Drukker, Karen; Chen, Weijie; Gichoya, Judy; Gruszauskas, Nicholas; Kalpathy-Cramer, Jayashree; Koyejo, Sanmi; Myers, Kyle; Sá, Rui C; Sahiner, Berkman; Whitney, Heather; et al (November 2023, Journal of Medical Imaging)

Full Text Available
Ability of artificial intelligence to identify self-reported race in chest x-ray using pixel intensity counts

https://doi.org/10.1117/1.JMI.10.6.061106

Burns, John Lee; Zaiman, Zachary; Vanschaik, Jack; Luo, Gaoxiang; Peng, Le; Price, Brandon; Mathias, Garric; Mittal, Vijay; Sagane, Akshay; Tignanelli, Christopher; et al (November 2023, Journal of Medical Imaging)

Purpose Prior studies show convolutional neural networks predicting self-reported race using x-rays of chest, hand and spine, chest computed tomography, and mammogram. We seek an understanding of the mechanism that reveals race within x-ray images, investigating the possibility that race is not predicted using the physical structure in x-ray images but is embedded in the grayscale pixel intensities. Approach Retrospective full year 2021, 298,827 AP/PA chest x-ray images from 3 academic health centers across the United States and MIMIC-CXR, labeled by self-reported race, were used in this study. The image structure is removed by summing the number of each grayscale value and scaling to percent per image (PPI). The resulting data are tested using multivariate analysis of variance (MANOVA) with Bonferroni multiple-comparison adjustment and class-balanced MANOVA. Machine learning (ML) feed-forward networks (FFN) and decision trees were built to predict race (binary Black or White and binary Black or other) using only grayscale value counts. Stratified analysis by body mass index, age, sex, gender, patient type, make/model of scanner, exposure, and kilovoltage peak setting was run to study the impact of these factors on race prediction following the same methodology. Results MANOVA rejects the null hypothesis that classes are the same with 95% confidence (F 7.38, P < 0.0001) and balanced MANOVA (F 2.02, P < 0.0001). The best FFN performance is limited [area under the receiver operating characteristic (AUROC) of 69.18%]. Gradient boosted trees predict self-reported race using grayscale PPI (AUROC 77.24%). Conclusions Within chest x-rays, pixel intensity value counts alone are statistically significant indicators and enough for ML classification tasks of patient self-reported race.
more » « less
Full Text Available
Leapfrogging Medical AI in Low-Resource Contexts Using Edge Tensor Processing Unit

https://doi.org/10.1109/HI-POCT54491.2022.9744071

Sinha, Priyanshu; Gichoya, Judy W.; Purkayastha, Saptarshi (March 2022, 2022 IEEE Healthcare Innovations and Point of Care Technologies (HI-POCT))

With each passing year, the state-of-the-art deep learning neural networks grow larger in size, requiring larger computing and power resources. The high compute resources required by these large networks are alienating the majority of the world population that lives in low-resource settings and lacks the infrastructure to benefit from these advancements in medical AI. Current state-of-the-art medical AI, even with cloud resources, is a bit difficult to deploy in remote areas where we don’t have good internet connectivity. We demonstrate a cost-effective approach to deploying medical AI that could be used in limited resource settings using Edge Tensor Processing Unit (TPU). We trained and optimized a classification model on the Chest X-ray 14 dataset and a segmentation model on the Nerve ultrasound dataset using INT8 Quantization Aware Training. Thereafter, we compiled the optimized models for Edge TPU execution. We find that the inference performance on edge TPUs is 10x faster compared to other embedded devices. The optimized model is 3x and 12x smaller for the classification and segmentation respectively, compared to the full precision model. In summary, we show the potential of Edge TPUs for two medical AI tasks with faster inference times, which could potentially be used in low-resource settings for medical AI-based diagnostics. We finally discuss some potential challenges and limitations of our approach for real-world deployments.
more » « less
Full Text Available
Margin-aware intraclass novelty identification for medical images

https://doi.org/10.1117/1.JMI.9.1.014004

Guo, Xiaoyuan; Gichoya, Judy W.; Purkayastha, Saptarshi; Banerjee, Imon (January 2022, Journal of Medical Imaging)

Full Text Available

« Prev Next »

Search for: All records